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INTRODUCTION 

Invasive  lobular  cancer  (ILC)  differs  both  morphologically  and  clinically  from  the  more  eommon 
ductal  (IDC)  type  of  breast  eaneer.  ILC  is  more  likely  to  be  steroid  hormone  receptor  positive,  to 
have  a  low  proliferation  rate,  to  be  diploid  and  to  exhibit  loss  of  e-cadherin  protein  expression. 
Loss  of  function  of  tumor  suppressor  genes  or  level  of  genomie  instability  may  differ  as  well,  but 
studies  to  date  have  examined  relatively  few  lobular  cases  at  only  limited  regions  of  the  genome. 
In  this  study  we  are  eonducting  a  genome-wide  assessment  of  allelic  loss,  or  loss  of 
heterozygosity  (LOH),  using  a  microarray  (Affymetrix  lOK  SNP  assay)  composed  of  10,000 
single  nucleotide  polymorphisms  (SNPs)  to  evaluate  the  rate  of  allelic  loss  on  individual 
chromosome  arms  in  60  each  of  lobular  and  ductal  breast  cancers  obtained  from  a  population- 
based  study  (NCI  ROl  CA85913,  P.I.  Janet  Baling).  Data  from  the  parent  study  will  allow  us  to 
correlate  allelie  loss  findings  with  risk  faetors  such  as,  hormone  replacement  therapy  (HRT)  use, 
and  other  tumor  charaeteristics  such  as  estrogen  and/or  progesterone  receptor  status.  A 
comparison  of  genome-wide  allelie  loss  in  ILC  and  IDC  will  provide  new  information  about 
regions  of  genetic  loss  and  genomic  instability  associated  with  particular  phenotypic  traits. 
Information  about  morphology-specific  traits  gained  from  studying  a  large  number  of  lobular 
eaneers  will  lead  to  an  inereased  understanding  of  the  biology  of  distinct  subsets  of  breast  cancer, 
and  provide  a  basis  for  future  studies  that  would  define  patient  stratification  into  prognostic  and 
treatment  groups  and/or  inform  the  development  of  targeted  therapies  for  specific  tumor  types. 

BODY 

The  following  deseribes  the  research  accomplishments  associated  with  each  item  in  the  approved 
Statement  of  Work. 

Task  1 :  Selection  of  60  lobular  and  60  ductal  cases,  (months  1-14) 

We  have  compiled  data  on  women  in  the  parent  study  as  they  are  enrolled.  We  use  an  algorithm 
that  seleets  subjects  based  on  number  of  flow  cytometrically  sorted  tumor  cells  available, 
estrogen  receptor  status  of  tumor,  stage  of  disease  and  age  (see  appendix  for  protocol).  To  date, 
55  ILC  and  60  IDC  tumors  have  been  identified.  DNA  has  been  extracted  from  the  tumors  and 
from  the  blood  lymphocytes  for  the  subject. 

Task  2:  Prepare  DNA  for  HuSNP™  analysis  (months  6-22) 

DNA  is  stored  at  -80°C  until  all  samples  are  collected.  Whole-genome  amplification  (PEP)  will 
be  performed  in  batches  when  all  120  samples  are  available. 

Task  3:  1  OK  array  analysis  months  (9-22) 

This  task  was  delayed  due  to  delayed  release  of  the  Affymetrix  product  but  we  have  been  able  to 
validate  the  array  in  collaboration  with  scientific  staff  at  Affymetrix.  The  new  array  with  over 
1 1,000  SNPs  has  been  tested  and  a  total  of  40  tumors  have  been  analyzed  with  the  assay.  We 
find  very  high  pass  rates  (an  indication  that  valid  data  are  generated  at  for  an  individual  SNP)  for 
the  fixed  samples  in  our  study.  We  have  compared  pass  rates  for  fixed  and  frozen  tumor  samples 
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and  find,  although  there  is  a  decrease  in  the  average  pass  rate  for  the  fixed  samples,  the  data  that 
are  generated  correlate  almost  perfectly  with  the  data  from  fresh  frozen  samples.  We  have  also 
compared  the  data  from  the  assay  with  data  from  array  comparative  genomic  hybridization 
(CGH)  on  the  same  tumors.  We  find  almost  complete  concordance  with  LOH  as  defined  by  the 
SNP  array  and  loss  of  copy  number  as  defined  by  the  Bacterial  Artificial  Chromosome  (BAC) 
clone  array  we  are  using  for  array  CGH.  There  are  regions  of  LOH  that  do  not  correspond  to 
copy  number  loss  and  most  likely  reflect  genomic  events,  such  as  mutation,  which  results  in 
LOH  without  loss  of  copy  number  change.  We  will  continue  to  evaluate  data  that  are  generated 
by  the  lOK  array  by  additional  methods  to  validate  LOH  at  specific  sites. 

Task  4.  Data  analysis  (months  9-24) 

Analysis  of  data  from  individual  tumors  is  being  conducted  now  and  data  analysis  to  determine 
the  differences  in  LOH  between  ductal  and  lobular  cancers  will  begin  after  we  complete  all 
assays  on  the  120  tumors. 

Task  5.  Publication  of  results  (months  22-24) 

We  have  not  completed  the  analysis  of  the  lOK  and  so  have  not  prepared  manuscripts  related  to 
the  LOH  data.  We  have  identified  differences  in  lobular  and  ductal  cancers  using  array  CGH  and 
currently  have  a  manuscript  in  revision  at  Cancer  Research(l). 

KEY  RESEARCH  ACCOMPLISHMENTS 

•  validation  of  the  prototype  HuSNP  platform  for  use  in  fixed  tissue  samples(2) 

•  validation  of  the  new  higher  density  10,000  SNP  array 

•  enrollment  and  identification  of  subjects  for  study  using  10,000  SNP  arrays 

•  flow  cytometry  cell  sorting  of  tumors  for  study 

•  establishment  of  new  array  platform  for  collection  of  data  from  10,000  SNPs  in  the  Porter  lab 
and  the  array  facility  at  the  FHCRC 

•  evaluation  of  40  tumors  using  the  lOK  SNP  array 

REPORTABLE  OUTCOMES 

Schubert  EL,  Malone  K,  Baling  JD,  Cousens  LG,  Porter  PL.  Whole  genome  LOH  analysis  of 
lobular  and  ductal  breast  cancers  by  HuSNP™  array.  Poster  24th  Annual  San  Antonio  Breast 
Cancer  Symposium,  2001. 

Schubert  EL ,  Hsu  L,  Cousens  LA,  Glogovac  J,  Self  S,  Reid  BJ,  Rabinovitch  P,  Porter  PL. 
Single  nucleotide  polymorphism  array  analysis  of  flow-sorted  epithelial  cells  from  frozen  versus 
fixed  tissues  for  whole  genome  analysis  of  allelic  loss  in  breast  cancer.  Am  J  Path,  160(1):  73- 
79,  2002. 

Loo  L,  Grove  D,  Neal  C,  et  al.  Array  CGH  Analysis  of  Genomic  Alterations  in  Breast  Cancer 
Sub-Types.  In  revision.  Cancer  Res,  2004. 
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CONCLUSIONS 

With  the  establishment  of  the  array  platform  at  the  FHCRC,  the  identification  of  subjects  and  the 
processing  of  DNA  from  selected  tumors,  we  are  close  to  the  data  collection  phase  of  the  study. 
We  will  generate  new  data  concerning  the  molecular  changes  associated  with  lobular  and  ductal 
breast  cancer  that  will  lead  to  an  increased  understanding  of  the  biologic  differences  between 
these  subsets.  Most  optimistically,  our  findings  could  inform  the  development  of  targeted 
detection  and  therapy  strategies  for  specific  tumor  types. 
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SHARE  /  LOBULAR  HuSNP  PROJECT 

Procedure  for  IDENTIFICATION  OF  ELIGIBLE  SUBJECTS,  SPECIMEN  TRANSFER, 
AND  DNA  EXTRACTION  OF  BUFFY  COATS  COLLECTED  AS  PART  OF  THE 
SHARE  STUDY 


All  tissue  requests  and  blood  collections  for  the  SHARE  study  originate  with  CERC  Studies 
staff.  The  Porter  Lab  will  receive  the  Formalin-fixed,  paraffin-embedded  tumor  tissue  in  the  lab 
and  test  DNA  extracted  from  flow-sorted  tumor  cells  against  a  normal  (DNA  extracted  from  the 
blood  lymphocytes  by  the  Specimen  Processing  Lab)  for  each  of  120  eligible  subjects  (60 
invasive  ductal  carcinomas  (8500/3),  and  60  invasive  lobular  carcinomas  (8520/3)  to  be  tested  by 
HuSNP  array.  The  selection  criteria  for  inclusion  into  the  HuSNP  project  are:  enrollment  in 
SHARE  study,  imilateral  breast  cancer,  diagnosis  date  between  January  1  2000  and  June  30 
2001,  Stage  II  or  lower,  ER  positive,  minimum  50K  flow-sorted  tumor  cells  available,  and  blood 
sample  collected  and  stored  by  CERC. 

1.  CASE  SELECTION  /  SUBJECT  IDENTIFICATION 

a.  Identification  of  potential  subjects  by  the  Porter  Lab 

Tumor  tissue  is  received  by  the  Porter  Lab.  After  histological  review  of  each  tumor,  IHC 
and  flow  cytometry,  the  Porter  Lab  identifies  ER  positive,  pure  lobular  and  pure  ductal 
invasive  breast  tumors  from  the  SHARE  cohort  for  which  a  minimum  of  50,000  sorted 
cells  have  been  stored.  Approximately  once  every  3  months,  the  Porter  Lab  prepares  a  list 
of  the  tumors  meeting  these  criteria,  identified  by  Study  ID  along  with  the  correlating 
histological  dx  (per  Porter  Lab  pathologists)  for  each  case.  The  list  is  sent  via  courier,  on 
a  disc,  to  CERC  Programmer. 

b.  Selection  of  subjects  by  CERC 

The  potential  subjects  identified  by  the  Porter  lab  are  screened  by  CERC  programmer  for 
additional  eligibility  parameters  (dx  date,  stage,  laterality,  and  current  blood  sample  availability). 
He  will  also  consider  the  histology  code  sent  by  the  Porter  Lab,  in  the  interest  of  acquiring  60 
suitable  cases  of  each  diagnosis  (invasive  ductal  and  invasive  lobular).  Patients  eliminated  by 
any  criterion,  or  any  who  have  refused  blood  donation,  are  noted  as  permanently  ineligible. 
Subjects  from  whom  blood  collection  has  not  yet  occurred  are  noted  as  temporarily  ineligible. 
Any  of  these  for  whom  blood  is  later  collected  are  included  with  the  next  group  of  specimens, 
after  CERC  Study  Manager  notifies  CERC  Programmer,  who  changes  their  status  to  “eligible” 
and  flags  them  for  inclusion  in  the  next  group. 

c.  Preparation  of  labels  and  tracking  lists 


CERC  Programmer 
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(1)  sends  CERC  Study  Manager  an  email  notifying  her  of  any  “temporarily 
ineligible”  subjects  for  blood  collection  follow-up.  He  also  sends  this  information 
to  Porter  Lab. 

(2)  generates  a  list  of  the  Study  IDs  and  Buffy  Coat  IDs  of  the  eligible  participants 
requested,  which  he  downloads  onto  a  disc  and  sends  via  courier  to  Porter  Lab. 

(3)  emails  a  “Buffy  Coat  Pack  List”  to  CERC  Study  Manager  listing  the  Buffy  Coat 
ID’s  of  the  specimens  to  be  pulled  and  sent  to  the  Specimen  Processing  Lab. 

(4)  prints  2  identical  tube  labels  for  each  blood  specimen,  on  which  is  printed  the  5 
digit  “Buffy  Coat  ID”  number  (with  which  the  buffy  coat  is  already  also  labeled) 
and  a  CERC  “DNA  Aliquot  ID”  number,  which  has  been  previously  set  aside  for 
this  purpose. 

•  CERC  Data  Coordinator 

(1)  prints  several  sheets  of  duplicate  sequential  “DNA  Aliquot  ID”  labels.  The  ID 
numbers  are  prefaced  by  the  letters  “SHA”.  Half  of  these  sheets  of  labels  are 
printed  in  bold  red  type  with  the  suffix  “QIA”  following  the  ID  number  on  the 
label.  The  other  labels  are  printed  in  standard  black  type  with  the  suffix  “PHE” 
following  the  number  on  the  label. 

(2)  prints  the  “SHARE  DNA  Tracking  Log  Forms”,  preparing  one  for  each  Buffy 
Coat  ID,  pre-printing  the  DNA  Aliquot  ID  to  appear  on  the  Porter  Lab’s  5  pg 
aliquot  on  the  form  in  the  “DNA  Aliquot  ID  for  Porter  Lab”  field,  as  well  as  the 
Buffy  Coat  ID. 

•  Cancer  Biology  Division  Programmer  downloads  the  list  of  eligible  subjects 

(received  from  CERC  Programmer)  into  the  Porter  Lab  SHARE  database. 

•  Porter  Lab  Data  Coordinator 

(1)  picks  up  the  disc  (if  any)  from  the  courier  and  delivers  it  to  Stephanie  Stafford. 

(2)  prepares  5  tube  labels  with  the  Porter  Lab’s  “Lab  Number”,  scheduled  extraction 
and  aliquotting  date,  and  the  correlating  Buffy  Coat  ID,  for  each  eligible  subject 
listed. 

•  CERC  Study  Manager 

(1)  attaches  the  two  special  labels  which  match  each  Buffy  Coat  ID  to  each 
correlating  “SHARE  DNA  Tracking  Log  Form”  with  a  paperclip. 

(2)  pulls  the  Buffy  Coats  listed  on  the  “Buffy  Coat  Pack  List”  ”  from  the  freezer  and 
packs  them  carefully  in  dry  ice  in  a  cooler  as  described  below. 

(3)  sends  the  Forms  and  Aliquot  ID  labels,  specimens  and  Pack  List  to  the  Specimen 
Processing  Lab  as  described  below 


2.  SPECIMEN  TRANSFER 

Study  Manager  contacts  Porter  Lab  via  phone  or  email  after  receiving  the  “Buffy  Coat  Pack 
List”  to  let  the  lab  know  exactly  when  to  send  the  blood  specimens  to  the  Specimen 
Processing  Lab.  Porter  Lab  then  contacts  the  Specirnen  Processing  Lab  to  schedule  the 


Assistance  Award  No.  DAMD1 7-02-1 -0386 


extractions  at  a  time  when  Porter  Lab  staff  can  pick  up  Porter  Lab  aliquot  immediately  after 
extraction.  Study  Manager  then  pulls  the  appropriate  blood  specimens  from  the  freezer  and 
packs  them  in  a  cooler  in  dry  ice.  She  includes  the  forms  and  labels  prepared  by  CERC 
programmer  in  the  package  and  labels  everything  clearly  with  “VIA  COURIER,  To:  The 
Specimen  Processing  Lab,  DE-120,  Day  Campus,  x  4645,  Specimens  Enclosed,  Keep 
Frozen,  Contact:  SHARE,  Porter  Lab,  x5470”.  The  package  is  then  left  with  the  receptionist 
at  the  Met  Park  2"**  floor  Reception  Desk  for  the  next  courier  run.  Alternatively,  the 
specimens,  forms,  and  labels  are  delivered  to  the  Specimen  Processing  Lab  by  CERC  Staff. 

3.  EXTRACTION  /  ELUSION  /  LAB  TRACKING 

The  Specimen  Processing  Lab  will: 

•  Perform  the  extractions  at  the  scheduled  time,  notifying  the  Porter  Lab  of  any 
schedule  changes  by  calling  Porter  Lab  at  667-5470  or  667-6187. 

•  Extract  one  quarter  of  each  huffy  coat  via  the  Qiagen  method,  and  three  quarters  by 
hand  using  the  phenol  chloroform  method. 

•  Indicate  the  date  of  the  extraction,  the  name  of  the  Specimen  Processing  Lab  Tech 
performing  the  extraction,  the  Elution  used,  and  total  volume  and  total  pg  extracted, 
and  the  final  concentration  on  the  “SHARE  DNA  Tracking  Log  Form”  for  each  huffy 
coat,  for  each  extraction  performed. 

•  Perform  the  extractions,  and  aliquot  all  resulting  material  from  the  Qiagen  method 
into  5  pg  aliquots  and  all  resulting  material  from  the  Phenol  chloroform  method  into 
Master  and  Children  aliquots  according  to  that  protocol. 

•  Phone  Porter  Lab  at  667-5470  or  667-6187  to  notify  them  that  the  extractions  are 
complete 

•  Ensure  that  the  first  aliquot  of  DNA  purified  by  the  Qiagen  method  from  each  Buffy 
Coat  contains  a  total  of  5  pg  of  DNA  where  possible,  at  a  concentration  of  50  ng/pl, 
designating  this  aliquot  for  the  Porter  Lab.  The  Porter  Lab  aliquot  remains 
UNFROZEN. 

•  Label  this  first  aliquot  of  DNA  purified  by  the  Qiagen  method  from  each  Buffy  Coat 
for  the  Porter  Lab,  using  the  provided  label  (attached  to  the  correlating  “SHARE 
DNA  Tracking  Log  Form”  with  a  paperclip),  which  has  both  the  originating  Buffy 
Coat  ID  number  and  a  regular  “DNA  Aliquot  ID”  printed  on  it.  One  such  label  goes 
onto  the  microcentrifuge  tube,  and  the  other  matching  label  with  the  same  DNA 
Aliquot  ID  goes  onto  the  “SHARE  DNA  Tracking  Log  Form”. 

•  Label  all  other  aliquots  with  a  standard  “DNA  Aliquot  ID”  label  (provided),  which 
begins  with  the  letters  “SHA”.  One  such  label  is  stuck  onto  the  tube,  and  the  other 
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matching  label  with  the  same  DNA  Aliquot  ID  is  stuck  onto  the  “SHARE  DNA 
Tracking  Log  Form”  on  the  line  for  that  aliquot. 

•  Complete  additional  fields  on  the  “SHARE  DNA  Tracking  Log  Form”  (“DNA  Amt”, 
DNA  Cone”  “Aliquot  Vol”,  and  “Box  Name  or  number”)  for  each  aliquot. 

•  Temporarily  store  the  Porter  Lab’s  aliquot  in  a  separate  box  marked  “SHARE  Qiagen 
Porter”  at  room  temperature  imtil  Porter  Lab  staff  arrives  to  perform  further 
aliquotting.  DO  NOT  FREEZE  THE  PORTER  LAB’S  ALIQUOT. 

•  Hand  off  the  Porter  Lab’s  aliquot  to  Porter  Lab  staff  for  further  aliquotting. 

•  Store  remaining  Qiagen  method  aliquots  in  the  freezer,  in  a  box  labeled  “SHARE 
Qiagen  W-D”,  and  all  aliquots  resulting  from  the  phenol  chloroform  method 
extraction  in  a  freezer  box  labeled  “SHARE  Phenol  W-D”. 

•  Notify  CERC  Study  Manager  (667-5044)  when  CERC’s  tubes  are  ready  for  pick-up 
and  transfer  to  their  own  freezers  for  long-term  storage. 

4.  RECEIPT  /  STORAGE 

a)  Porter  Lab  Research  Tech  will  be  present  when  the  extractions  take  place  so  that  the 
Porter  Lab’s  5-pg  aliquot  for  each  participant  can  be  picked  up  at  room  temperature  and 
re-aliquotted  by  them  into  5  separate  microcentrifuge  tubes,  which  they  then  label  with 
the  DNA  Aliquot  ID,  concentration  (ng/pg),  current  date,  and  the  Porter  Lab’s  “Lab 
number”. 

b)  When  aliquotting  of  all  extracted  material  is  finished,  the  completed  “SHARE  DNA 
Tracking  Log  Forms”  and  the  Porter  Lab's  aliquots  are  taken  back  to  the  Porter  Lab  for 
“check  in”  and  storage.  Discrepancies  between  the  imported  list  of  Buffy  Coat  ID’s  and 
those  appearing  on  the  Forms  are  noted  and  resolved  after  investigation  by  Porter  Lab 
and  CERC  Study  Manager,  and  records  are  updated  accordingly  if  necessary.  The 
“SHARE  DNA  Tracking  Log  Forms”  are  photocopied,  and  the  copies  are  filed  in  the 
Porter  Lab  by  date  of  extraction.  The  original  Forms  are  sent  to  CERC  Study  Manager  for 
data  entry  and  storage. 

c)  The  two  boxes  containing  CERC’s  aliquots,  which  are  temporarily  stored  in  the 
Specimen  Processing  Lab’s  freezers,  are  picked  up  by  CERC  staff  and  transported  by 
hand  back  to  their  own  freezers.  They  are  then  put  into  long  term  storage  there  while 
testing  methods  are  being  developed  for  further  study,  and  for  archival  purposes  in  case 
additional  DNA  is  needed  by  the  Porter  Lab  for  testing. 


appendix  2 


American  Journal  of  Pathology,  Vol.  160,  No.  1,  January  2002 
Cc^yyright  ©  American  Society  for  Investigative  Pathology 


Technical  Advance 

Single  Nucleotide  Polymorphism  Array  Analysis  of 
Flow-Sorted  Epithelial  Cells  from  Frozen  Versus 
Fixed  Tissues  for  Whole  Genome  Analysis  of  Allelic 
Loss  in  Breast  Cancer 


Elizabeth  L.  Schubert,*  Li  Hsu,’*^ 
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Cancer  Research  Center,  Seattle,  and  the  Departments  of 
Gastroenterology*  and  Pathology^  University  of  Washington, 
Seattle,  Washington 


Analysis  of  allelic  loss  in  archival  tumor  specimens  is 
constrained  by  quality  and  quantity  of  tissue  and  by 
technical  limitations  on  the  number  of  chromosomal 
sites  that  can  be  efficiently  evaluated  in  conventional 
analyses  using  polymorphic  microsatellite  markers. 
Newly  developed  array-based  assays  have  the  poten¬ 
tial  to  yield  genome-wide  data  from  small  amounts  of 
tissue  but  have  not  been  validated  for  use  with  rou¬ 
tinely  processed  specimens.  We  used  the  Affymetrix 
HuSNP  assay,  composed  of  1494  single  nucleotide 
polymorphism  sites,  to  compare  allelic  loss  results 
obtained  from  both  formalin-fixed  and  frozen  breast 
tissue  samples.  Tumor  cells  were  separated  from  nor¬ 
mal  epithelia  and  nonepithelial  cells  by  dissection 
and  bivariate  cytokeratin/DNA  flow  sorting;  normal 
breast  cells  from  the  same  patient  served  as  constitu¬ 
tive  normal.  Allele  results  from  the  HuSNP  array  av¬ 
eraged  96%  reproducibility  between  duplicates  and 
were  concordant  between  the  fixed  and  frozen  nor¬ 
mal  samples.  We  also  analyzed  DNA  from  the  same 
samples  after  whole-genome  amplification  (primer 
extension  preamplification).  Although  overall  signal 
intensities  were  lower,  the  genotype  data  from  the 
primer  extension  preamplification  material  was  con¬ 
cordant  with  genomic  DNA  data  from  the  same  sam¬ 
ples.  Results  from  genomic  normal  tissue  DNA  aver¬ 
aged  informative  single  nucleotide  polymorphism  at 
379  (25%)  loci  genome-wide.  Although  data  points 


were  clustered  and  some  segments  of  chromosomes 
were  not  informative,  our  data  indicated  that  the 
Aflymetrix  HuSNP  assay  could  provide  an  efficient 
and  valid  genome-wide  analysis  of  allelic  imbalance 
in  routinely  processed  and  whole  genome-ampli¬ 
fied  pathology  specimens.  (Am  J  Pathol  2002, 
160:73-79) 

Loss  of  heterozygosity  (LOH),  or  allelic  loss,  is  one  of  the 
most  frequent  genetic  abnormalities  in  breast  cancer.  It 
may  serve  as  a  marker  of  generalized  genomic  instability, 
and  when  frequently  observed  in  a  region,  it  is  consid¬ 
ered  indirect  evidence  for  the  presence  of  a  tumor  sup¬ 
pressor  gene  within  that  region  of  loss.  In  sporadic  breast 
cancer,  allelic  loss  at  multiple  chromosomal  locations  has 
been  identified  in  a  range  of  invasive  and  preinvasive 
breast  cancers  as  well  as  benign  and  normal  breast 
epithelium  adjacent  to  tumor.'’"®  However,  a  complete 
evaluation  of  LOH  in  breast  cancer  has  been  hampered 
by  the  limited  number  of  polymorphic  markers  available 
for  study;  the  heterogeneity  of  breast  tissue  (mixed  non- 
tumor  and  tumor  cells);  the  lack  of  sufficient  numbers  of 
fresh  or  frozen  samples  with  associated  demographic  or 
clinical  data,  and  the  small  amount  of  tissue  available 
from  currently  diagnosed  breast  cancers.  To  address 
these  limitations,  we  used  flow  cytometry  to  select  and 
purify  tumor  cells  from  routinely  processed  tissue  blocks, 
whole  genome  amplification  to  increase  the  amount  of 
DNA  available  for  study,  and  a  microarray  assay  to  as¬ 
sess  ail  chromosomes  efficiently  and  simultaneously. 

The  newly  developed  Affymetrix  HuSNP  array,  which 
contains  1494  single  nucleotide  polymorphism  (SNP) 
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sites  genome-wide  and  requires  only  135  ng  of  genomic 
DNA  (gDNA)  per  assay,  is  a  potential  platform  for  evalu¬ 
ating  genome-wide  genetic  analysis  of  breast  tissue.  The 
usefulness  of  a  prototype  SNP  array  and  the  current 
HuSNP  array  for  analysis  of  allelic  loss  in  fresh  lung 
tumors  removed  at  autopsy  and  fresh  biopsies  from 
esophageal  cancers,  respectively,  has  been  previously 
described.'*'®  However,  the  analysis  of  formalin-fixed, 
paraffin-embedded  pathology  specimens  by  the  com¬ 
mercially  available  HuSNP  assay  has  not  been  reported. 

Here  we  discuss  the  use  of  the  HuSNP  to  examine 
allelic  imbalance  in  both  frozen  and  fixed  pathology 
specimens  and  compare  results  between  the  two  pres¬ 
ervation  methods.  To  purify  populations  of  cells  from  the 
tissue  for  analysis  we  used  bivariate  flow  cytometry, 
which  allowed  us  to  sort  tumor  cells  for  analysis  based  on 
positive  cytokeratin  staining  and  gDNA  content.®  In  ad¬ 
dition  to  gDNA,  we  also  examined  the  use  of  a  polymer¬ 
ase  chain  reaction  (PCR)-based  whole-genome  amplifi¬ 
cation  method,  primer-extension  preamplification  (PEP) 
that  increases  the  amount  of  template  available  for  anal¬ 
ysis  ~30-fold’’'®  and  compared  allelic  loss  results  from 
the  PEP  product  to  results  with  gDNA.  HuSNP  allelic  loss 
results  were  also  compared  to  results  from  conventional 
polymorphic  microsatellite  markers  (short  tandem  re¬ 
peats  or  STRs)  on  chromosomes  11  and  17. 


Materials  and  Methods 
Tissue  Samples 

Tumor  and  normal  tissue  from  two  breast  cancer  patients 
were  obtained  from  the  University  of  Washington  tissue 
bank  with  patient  consent  and  in  compliance  with  the 
Institutional  Review  Board.  Samples  taken  at  the  time  of 
surgery  were  divided  into  two  portions  and  each  portion 
was  processed  routinely  either  by  freezing  in  OCT  media 
or  formalin  fixation  followed  by  paraffin  embedding.  No 
gross  difference  was  apparent  between  the  portions  se¬ 
lected  for  either  preservation  method.  The  presence  of 
tumor  in  each  block  was  confirmed  microscopically.  A 
formalin-fixed  tissue  block  from  each  of  the  cases  was 
tested  for  estrogen  receptor,  progesterone  receptor,  c- 
erbB2  oncogene  protein,  and  p53  tumor  suppressor 
gene  protein  by  immunohistochemistry  as  previously  de¬ 
scribed,® 


Flow  Cytometry 

Flow  cytometry  was  performed  on  the  frozen  and  fixed 
samples  to  purify  tumor  cells  from  normal  epithelia  and 
nonepithelial  cells.  Hematoxylin  and  eosin  (H&E)-stained 
slides  from  both  frozen  and  paraffin-embedded  tumor 
sections  were  examined  to  confirm  that  the  samples  con¬ 
tained  tumor  epithelium.  Similarly,  H&E  slides  taken  from 
the  normal  block  confirmed  that  the  sample  contained  no 
tumor. 

From  each  frozen  breast  tissue  sample,  20  to  50 
50-/Lim  sections  were  cut  and  placed  into  phosphate- 


buffered  saline  containing  1%  bovine  serum  albumin 
(PBA).*° 

The  samples  were  mechanically  disaggregated  and 
washed  in  PBA.  The  resulting  cell  suspensions  were  fixed 
in  0.5%  electron  microscopy  grade  formaldehyde  and 
permeabilized  in  0.1%  triton/PBA  before  staining.  From 
formalin-fixed  tissue  blocks,  flow  cytometry  preparation 
was  performed  as  described,®  Briefly,  1  to  20  GO-fj-m 
sections  were  cut  from  normal  and  tumor  tissue  blocks; 
regions  of  tumor  in  each  section  were  dissected  from 
surrounding  tissue  with  a  scalpel  blade.  All  sections  were 
deparaffinized,  rehydrated,  and  digested  in  collagenase 
before  a  brief  pepsin  digestion. 

Cell  suspensions  from  both  the  frozen  and  fixed  sam¬ 
ples  were  stained  with  4,6-diamidino-2-phenylindole  and 
R-phyco-erythrin  labeled  AE1/AE3  (Roche,  Indianapolis, 
IN),  which  recognizes  a  wide  variety  of  acidic  and  basic 
cytokeratins.  A  parallel  sample  of  cells  was  stained  with 
R-PE-labeled  isotype-matched  mouse  Ig  (R-PE  labeled 
IgGI;  DAKO,  Carpinteria,  CA)  and  used  as  a  negative 
control.  Before  sorting,  all  samples  were  forced  through  a 
25-gauge  needlelO  times  to  ensure  a  single  cell  suspen¬ 
sion. 

Cytokeratin-positive  tumor  cells  were  sorted  by  bivari¬ 
ate  analysis  with  488  nm  and  UV  excitation  on  a  Becton 
Dickinson  (Mountain  View,  CA)  FACS  Vantage.  R-PE, 
cytokeratin-positive  populations  were  sorted  based  on 
their  4,6-diamidino-2-phenylindole-fluorescent  DNA  con¬ 
tent,  expressed  as  DNA  index  (D1  =  mean  aneuploid 
fluorescence/mean  diploid  G^  fluorescence).  Cells  from 
the  normal  blocks  were  processed  and  stained  similarly 
to  the  tumor  samples.  The  DNA  from  all  cells  in  the  normal 
blocks  was  used  as  the  constitutive  normal  for  compari¬ 
son  with  the  tumor  cell  DNA. 

Preparation  of  DNA  Samples 

DNA  was  extracted  from  frozen  cells  using  the  Puregene 
DNA  isolation  kit  (Centra  Systems,  Minneapolis,  MN), 
following  the  manufacturer's  suggestions  with  the  addi¬ 
tion  of  1  jxl  of  20  mg/ml  of  Proteinase  K  to  the  cell  lysis 
buffer,  followed  by  incubation  at  50°C  for  1  to  16  hours. 
DNA  was  extracted  from  fixed  cells  using  a  simple  Pro¬ 
teinase  K  digestion  method  previously  described.'’*  Ex¬ 
tracted  DNA  samples  were  quantified  using  the  Pi- 
cogreen  dsDNA  Quantitation  Kit  (Molecular  Probes, 
Eugene,  OR)  on  the  Cytofluor  II  Fluorescence  Multiwell 
Plate  Reader  (PerSeptive  Biosystem  Inc.,  Framingham, 
MA). 

Whole  genome  amplification  using  the  primer  exten¬ 
sion  protocol  (PEP)  was  performed  as  described.®  For 
each  gDNA  sample,  six  individual  PEP  reactions,  each 
using  7  ng  of  gDNA  as  template,  were  performed  and  the 
PEP  material  pooled.'’^  PEP  material  was  used  directly  in 
the  array  protocol  without  purification  or  alteration  of  con¬ 
centration. 

STR  Protocol 

Twenty-four  polymorphic  repeat  loci  (STRs)  on  chromo¬ 
somes  11  and  17  were  amplified  using  fluorescent  prim- 
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ers  with  PEP  template.  Chromosomes  11  and  17  were 
selected  for  allelic  loss  comparison  between  arrays  and 
conventional  repeat  markers  because  both  contain  sites 
that  are  frequently  lost  in  breast  cancer.®'®  Markers  were 
selected  from  those  commercially  available  from  Re¬ 
search  Genetics  {www.resgen.com)  to  obtain  a  survey  of 
sites  that  corresponded  as  closely  as  possible  to  the 
HuSNP  sites  along  the  chromosomes.  The  physical  loca¬ 
tions  of  the  markers  in  Mb  are  listed  in  Figure  1,  B  and  C, 
as  given  by  National  Center  for  Biotechnology  Informa¬ 
tion  in  July  2001  {www.ncbi.nlm.nih.gov).  Primers  for  chro¬ 
mosome  11  were  (llptel)  D11S1397,  D11S2368, 
D11S2001,  D11S1918,  D11S1395,  (cen),  D11S4076. 
D11S1394,  D11S4151,  D11S2360,  (11qtel).  Those  for 
chromosome  17  were  (17ptel),  D17S919,  D17S1298, 
D17S1537,  TP53,  D17S786,  D17S1541,  D17S974, 
D17S975,  (cen),  D17S1293,  D17S1158,  D17S1294, 
D17S1185,  D17S1305,  D17S1290,  D17S1288,  {17qtel). 
PCR  reactions  were  performed  using  standard  protocols 
with  PEP  material  as  a  template.  PCR  reaction  products 
were  multiplexed  and  then  purified  using  Microcon-100 
columns,  after  which  the  DNA  was  resuspended  in  sterile 
water.  Reactions  were  run  on  an  ABI  377  and  analyzed 
using  ABI  Prism  Gene  Scan  software. 

To  evaluate  allelic  loss  for  each  marker''®'''®  the  peak 
height  of  the  first  allele  was  divided  by  the  peak  height  of 
the  second  allele  to  obtain  the  allelic  ratio  (AR).  Samples 
were  deemed  informative  at  a  locus  if  the  AR  for  the 
normal  tissue  sample  was  sufficiently  close  to  1  (defined 
operationally  as  0.7  <  AR  <  1/0.7).  For  informative  loci, 
an  index  Q  was  computed  as  the  AR  of  the  tumor  tissue 
sample  divided  by  the  AR  of  the  normal  tissue  sample.'"' 
A  locus  was  scored  as  having  LOH  if  the  Q  value  was 
sufficiently  far  from  1  (defined  operationally  as  either  Q  < 
0.3  or  Q  >  1/0.3).  A  locus  was  scored  as  retaining  het¬ 
erozygosity  if  the  Q  value  was  sufficiently  close  to  1 
(defined  operationally  as  0.7  <  Q  <  1/0.7). 

HuSNP  Protocol 

The  Affymetrix  HuSNP  protocol  was  performed  according 
to  manufacturer’s  instructions  and  as  described.''  Each 
individual  gDNA  sample  from  both  cases  was  analyzed 
twice  in  completely  separate  reactions,  to  yield  data  from 
a  total  of  1 8  HuSNP  arrays  for  the  two  cases  (five  samples 
from  case  1  in  duplicate  plus  four  samples  from  case  2  in 
duplicate).  Similarly,  the  PEP  material  from  each  sample 
was  analyzed  by  HuSNP  in  duplicate  (18  HuSNP  arrays). 
Data  analysis  using  the  Affymetrix  Genechip  software 
resulted  in  genotype  calls  that  were  used  in  the  statistical 
analysis.  The  genetic  map  used  in  the  analysis  came 
from  Affymetrix,  release  date  June  2001 . 

Statistical  Methods 

To  quantify  the  reproducibility  of  the  HuSNP  chips,  the 
reliability  measure  was  calculated.  The  reproducibility  for 
making  a  consistent  genotype  call  was  defined  as  the 
number  of  SNPs  with  the  same  genotype  calls  from  both 
replicates  divided  by  the  total  number  of  SNPs  for  which 


both  replicates  yielded  signal  calls.  The  reproducibility 
for  making  no-signal  calls  was  also  calculated,  and  was 
defined  as  the  number  of  SNPs  for  which  both  replicates 
yielded  no-signal  calls  divided  by  the  total  number  of 
SNPs  for  which  at  least  one  replicate  yielded  a  no-signal 
call. 

Similarly,  concordance  of  genotype  and  no-signal  calls 
were  measured  between  frozen  and  fixed  tissue  samples 
as  well  as  between  gDNA  and  PEP  samples.  Because 
each  sample  was  analyzed  in  duplicate,  there  were  a 
total  of  four  possible  comparisons  between  each  set  of 
fixed  and  frozen  samples.  The  concordance  measure 
was  calculated  by  the  ratio  of  the  average  over  the  four 
comparisons  of  the  number  of  SNPs  with  same  genotype 
calls  from  both  samples  and  the  average  over  the  four 
comparisons  of  the  number  of  SNPs  for  which  both  sam¬ 
ples  yielded  signal  calls.  The  concordance  measure  for 
no-signal  calls  was  calculated  similarly. 

The  informativlty  and  allelic  loss  of  the  SNPs  was  ex¬ 
amined  for  both  cases.  We  defined  a  SNP  as  informative 
when  one  normal  tissue  replicate  of  the  SNP  was  het¬ 
erozygous  (AB)  and  the  other  replicate  was  either  het¬ 
erozygous  or  had  no  signal.  We  defined  a  SNP  site  as 
having  allelic  loss  when  that  SNP  was  informative  in  the 
normal  tissue,  one  tumor  tissue  replicate  of  the  SNP  was 
hemizygous  or  homozygous  (AA  or  BB),  and  the  other 
replicate  was  either  hemizygous,  homozygous,  or  no  signal. 

Ail  statistical  analyses  were  performed  using  SPLUS 
statistical  software  (S-PLUS  Reference  Manual,  version 
3:2;  Statistical  Sciences  I,  Seattle,  Washington). 


Results 

Subject  and  Tumor  Characteristics 

The  results  of  the  pathology  review  and  immunohisto- 
chemical  assays  from  the  two  cases  used  in  this  study 
are  shown  in  Table  1.  The  patients'  ages  were  similar  at 
their  respective  times  of  diagnosis.  Flow  cytometric  anal¬ 
ysis  revealed  multiple  aneuploid  cell  populations  in  the 
tumor  from  case  1.  One  cell  population  from  the  fixed 
tumor  and  one  from  the  frozen  tumor  had  very  similar  DIs 
(1.49  and  1.43,  respectively),  whereas  an  additional  cell 
population  seen  only  in  the  fixed  portion  of  the  tumor  had 
a  distinct  Dl  of  1 .82.  Case  2  had  a  single  aneuploid  tumor 


Table  1.  Patient  and  Tumor  Characteristics  of  the  Two 
Breast  Cancer  Cases 


Case  1 

Case  2 

Histologic  type 

Lobular 

Ductal 

Age  at  diagnosis 

48 

46 

AJCC  stage 

illA 

IIA 

ER  protein  status 

Negative 

Positive 

PR  protein  status 

Negative 

Positive 

c-erb-2  protein  status 

Negative 

Positive 

P53  protein  status 

Positive 

Negative 

DNA  index 

Multiple  aneuploid 

Aneuploid 

DNA  index — ^frozen  tumor 

1.43 

1.79 

DNA  index — ^fixed  tumor 

1.49  and  1.82 

1.76 

ER,  estrogen  receptor;  PR,  progesterone  receptor. 
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Figure  1.  Graphical  representation  of  HuSNP  and  STR  allelic  loss  data  for  chromosomes  6  (A),  11  (B),  and  17  (C)  from  both  patients.  SNP  data  are  depicted  as 
squares  and  STR  data  are  indicated  by  circles.  Shown  are  the  data  from  all  five  cell  populations  isolated  by  bivariate  flow  cytometry  in  the  two  patients’  tumors. 
SNP  loci  that  were  uninformative  in  both  the  frozen  and  fixed  material  from  a  patient  were  removed  from  this  diagram.  White  squares  and  circles  indicate  allelic 
loss,  black  square  and  circles  indicate  retention,  and  gray  squares  and  circles  indicate  lack  of  informativity  or  data  (in  the  STR  markers)  or  no  signal  calls  (in 
the  HuSNP).  Markers  are  arranged  on  the  diagram  to  represent  their  physical  distribution  on  the  chromosome. 


Use  of  SNP  Array  for  LOH  in  Breast  Pathology  Specimens  77 

AJP  January  2002,  Vol.  160,  No.  1 


Table  2.  Reproducibility  of  HuSNP  Calls  for  Each  Sample  Type* 


Frozen  normal 

Frozen  tumor 

Fixed  normal 

Fixed  tumor 

Definite 

No  signal 

Definite 

No  signal 

Definite 

No  signal 

Definite 

No  signal 

calls'* 

calls 

calls 

calls 

calls 

calls 

calls 

calls 

gDNA 

98% 

72% 

96% 

65% 

98% 

75% 

95% 

75% 

PEP 

97% 

75% 

95% 

67% 

95% 

70% 

93% 

79% 

•The  average  percent  of  definite  (AA,  AB,  and  BB)  and  no  signal  HuSNP  genotype  calls  that  were  identical  between  independent,  duplicate 
analyses  of  each  sample  type. 

+AA,  AB,  or  BB  calls. 


cell  population  distinguishable  in  both  the  fixed  (Dl  = 
1.76)  and  frozen  (Dl  =  1.79)  tissue  samples.  All  three 
tumor  cell  populations  from  case  1  and  both  from  case  2 
were  tested  independently  and  included  in  the  subse¬ 
quent  array  analysis  using  both  gDNA  and  PEP  material 
from  these  cases. 

HuSNP  Analysis 

For  each  SNP  site  on  the  chip,  genotype  results  from  the 
Affymetrix  Genechip  software  were  reported  as  definite 
calls  (AA,  AB,  BB),  no  signal,  or  an  intermediate  call 
(AB_A  or  AB-B),  The  Genechip  software  does  not  score 
allele  copy  number  but  instead  always  indicates  two 
alleles  (AA  and  BB).  The  HuSNP  chip  contains  1494 
individual  SNP  sites,  however  our  experience  was  similar 
to  that  of  a  previous  report, in  that  more  than  100  of  the 
1494  sites  on  the  chip  consistently  failed,  yielding  most  of 
the  no-signal  calls.  Intermediate  calls  were  rare,  seen  in 
~1%  of  sites  in  each  assay.  The  reproducibility  statistics 
for  definite  calls  and  no-signal  calls  between  duplicate 
assays  using  the  same  gDNA  or  PEP  sample  are  shown 
in  Table  2.  Table  3,  A  and  B,  show  the  concordance  of 
definite  and  no-signal  results  between  the  fixed  and  fro¬ 
zen  gDNA  and  PEP  samples  from  each  case  (Table  3A) 
as  well  as  the  concordance  between  the  gDNA  and  PEP 
results  (Table  3B).  A  graphical  representation  of  concor¬ 


dance  between  fixed  and  frozen  gDNA  samples  on  chro¬ 
mosomes  6,  11,  and  17  are  presented  in  Figure  1. 

The  allelic  loss  results  for  each  chromosome  and 
gDNA  sample  source  are  shown  in  Table  4.  PEP  results 
were  similar  to  the  gDNA,  as  indicated  by  the  genotype 
concordances  shown  in  Table  3B.  Informativity  varied 
slightly  between  the  fixed  and  frozen  samples  and  re¬ 
sulted  in  some  differences  in  allelic  loss  results  between 
the  paired  samples  as  shown  in  Table  4.  This  variance 
was  primarily  because  of  no-signal  calls  at  a  particular 
SNP  site  in  one  sample  type  or  the  other  and  not  to  actual 
differences  in  calls  between  the  fixed  and  frozen  sam¬ 
ples.  The  exception  to  this  is  in  case  1  that  contained 
more  than  one  aneuploid  population  of  tumor  cells  (Table 
1).  Allelic  losses  were  reproducible  in  the  duplicate  anal¬ 
yses  and  generally  concordant  with  adjacent  sites  in 
large  regions  along  chromosomes,  as  is  shown  visually 
for  chromosomes  6,  11,  and  17  in  Figure  1. 

Comparison  of  HuSNP  and  STR  Analyses 

Of  the  24  STR  markers  analyzed  on  chromosomes  1 1  and 
17,  there  were  a  total  of  69  informative  sites  between  all 
five  tumor  populations  identified  in  the  two  patients  (see 
Figure  1,  B  and  C,  for  details).  Of  these  69  sites,  60 
showed  correlation  with  data  from  adjacent  HuSNP  mark¬ 
ers.  However,  at  the  nine  STR  sites  that  do  not  correlate 


Table  3A. 


The  Concordance'*'  of  Definite  and  No  Signal  Calls  between  the  Duplicate  Analyses  of  Frozen  and  Fixed  Samples  from 
Each  of  Two  Breast  Cancer  Cases 


Case  1 


Case  2 


Normal  tissue  Tumor  tissue  Normal  tissue  Tumor  tissue 

Definite  No  signal  Definite  No  signal  Definite  No  signal  Definite  No  signal 
calls*  calls  calls  calls  calls  calls  calls  calls 


gDNA  1162(97%)*  185(64%)  1071  (92%)  199(61%)  1150(93%)  182(71%)  1019(95%)  189(45%) 

PEP  878(92%)  543(36%)  766(87%)  288(47%)  1131(93%)  185(68%)  919(95%)  196(37%) 


Table  3B.  Concordance*  of  gDNA  and  PEP  DNA  Data  for  Each  Sample  Type  in  Both  Cases 


Frozen  normal  Frozen  tumor  Fixed  normal  Fixed  tumor 


Definite 

calls* 

No  signal 
calls 

Definite 

calls 

No  signal 
calls 

Definite 

calls 

No  signal 
calls 

Definite 

calls 

No  signal 
calls 

1004  (95%) 
1183  (98%) 

21 1  (49%) 
162  (57%) 

1006(92%) 
1166  (97%) 

219  (54%) 
167  (58%) 

909(91%) 

1166(97%) 

188  (38%) 
167  (58%) 

1200(98%) 
825  (93%) 

207  (78%) 
263  (43%) 

*AA,  AB,  or  BB  calls. 

■•■The  concordance  of  genotype  calls  between  frozen  and  fixed  material  from  each  of  the  cases.  The  numbers  are  the  number  of  sites  average  over 
replicates  that  gave  concordant  genotypes  out  of  the  1 494  sites  on  the  HuSNP  array. 

^Percentages  are  the  average  percent  of  concordant  sites  out  of  those  with  that  type  of  call.  See  text  for  a  description  of  how  concordance  was 
calculated. 
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Table  4.  Number  of  Informative  SNPs  and  LOH  by  Chromosome,  Identified  in  the  Fixed  and  Frozen  Samples  from  Two  Cases  of 
Breast  Cancer 


Chromosome 

Case  1 

Case  2 

Frozen  tissue 
LOH'''/informative* 

(Dl  =  1.43) 

Fixed  tissue 

LOH/informative  LOH/informative 

(Dl  =  1.49)  (Dl  =  1.82) 

Frozen  tissue 
LOH/informative 
(Dl  =  1.79) 

Fixed  tissue 
LOH/informative 
(Dl  =  1.76) 

1 

1/27 

0/26 

2/26 

0/32 

0/28 

2 

8/26 

10/25 

9/25 

1/29 

1/23 

3 

12/26 

11/28 

13/28 

0/36 

0/25 

4 

11/16 

3/16 

12/16 

0/22 

0/15 

5 

1/17 

1/17 

1/17 

0/14 

0/10 

6 

20/33 

19/35 

21/35 

0/27 

1/26 

7 

1/16 

1/16 

2/16 

0/21 

0/19 

8 

0/33 

0/33 

1/33 

18/36 

15/27 

9 

5/22 

5/22 

4/22 

0/17 

0/15 

10 

7/10 

8/12 

9/12 

0/20 

1/15 

11 

17/20 

0/21 

6/21 

5/23 

4/22 

12 

2/18 

1/18 

3/18 

0/6 

0/6 

13 

0/14 

0/11 

1/11 

0/8 

0/5 

14 

0/9 

0/9 

0/9 

1/13 

1/10 

15 

0/16 

0/17 

1/17 

0/16 

0/14 

16 

0/11 

0/12 

1/12 

6/11 

6/8 

17 

19/20 

18/21 

21/21 

6/15 

6/13 

18 

2/13 

3/13 

4/13 

7/10 

4/7 

19 

0/16 

0/17 

1/17 

7/17 

1/11 

20 

0/5 

0/6 

0/6 

0/9 

0/6 

21 

0/6 

0/6 

0/6 

0/8 

0/5 

22 

0/6 

0/8 

1/8 

5/5 

5/5 

X 

0/5 

0/6 

5/6 

0/7 

0/7 

Unmapped 

0/2 

1/3 

1/3 

0/4 

0/2 

Total 

106/387  (27%) 

81/398  (20%) 

119/398(30%) 

56/406  (14%) 

45/324  (14%) 

*SNP  sites  where  the  normal  sample  genotype  call  was  AB  (see  Materials  and  Methods  text). 

■•■SNP  sites  which  were  informative  in  the  normal  and  had  either  AA  or  BB  genotype  calis  in  the  tumor  sample  (see  Materials  and  Methods  text). 


with  adjacent  HuSNP  markers,  it  is  difficuit  to  determine 
whether  the  apparent  discordance  is  because  of  techni¬ 
cal  limitations  or  if  the  STR  marker  is  recognizing  a  small 
region  with  a  different  allelic  loss  pattern  than  the  adjacent 
regions  scored  by  SNP.  At  four  of  the  nine  sites  (D1 1S1394 
and  D17S1288  in  case  1,  and  D17S1294  in  case  2),  there 
was  at  least  a  5-Mb  distance  between  the  STR  and  SNP 
markers,  which  may  be  the  reason  for  the  discrepancy. 


Discussion 

This  study  examined  the  feasibility  of  using  array  tech¬ 
nology,  specifically  the  commercially  available  Affymetrix 
HuSNP  array,  for  genome-wide  allelic  loss  analysis  of 
both  fixed  and  frozen  breast  pathology  specimens.  Ar¬ 
chival  pathology  specimens  are  a  valuable  resource  for 
the  genetic  analysis  of  tumors.  However,  the  limited 
quantity  and  quality  of  DNA  available  is  a  serious  limita¬ 
tion  for  genetic  analysis  of  such  specimens.  The  quality  of 
DNA  obtained  from  pathology  specimens  is  compro¬ 
mised  by  routine  preservation  methods  that  were  neither 
designed  for,  nor  are  optimal  for,  DNA  preservation.  For¬ 
malin,  the  most  commonly  used  fixative  for  pathology 
tissue  specimens,  has  been  shown  to  reduce  the  size  of 
PCR  segments  that  may  be  amplified  from  a  sample.''®  In 
our  experience  as  well  as  in  reports  from  the  literature, 
DNA  extracted  from  paraffin-embedded  tissues  most  re¬ 
liably  yields  PCR  results  in  small  amplicons,  often  under 
200  nucleotides.''®’''®  Tissues  frozen  in  OCT  media  for 


frozen  section  diagnosis  suffer  less  of  a  direct  insult  to 
DNA  quality  but  are  still  subject  to  handling  and  storage 
exposures  that  may  result  in  DNA  fragmentation.  Ideally, 
a  genomic  analysis  technique  for  pathology  specimens 
would  maximize  the  data  obtained  from  nanogram  quan¬ 
tities  of  low-molecular  weight  DNA.  Our  study  sought  to 
validate  array  technology  such  as  that  used  in  the  HuSNP 
array  for  use  with  such  specimens. 

In  this  study,  samples  were  analyzed  in  duplicate  to 
generate  reliability  statistics  for  each  type  of  sample,  and 
genotype  data  were  compared  between  fixed  and  frozen 
samples  to  examine  the  data  concordance  between  sam¬ 
ple  types.  The  HuSNP  array  yielded  genotype  results  that 
were  reliable  and  concordant  for  both  fixed  and  frozen 
tumor  and  normal  breast  pathology  specimens.  Impor¬ 
tantly,  the  DNA  fragmentation  that  occurs  with  formalin 
fixation  does  not  seem  to  affect  HuSNP  results,  presum¬ 
ably  because  the  assay  relies  on  PCR  amplicons  that  are 
shorter  than  100  nucleotides  in  length. 

In  addition  to  analysis  of  genomic  DNA  extracted  from 
these  specimens,  we  also  examined  the  data  obtained 
from  whole  genome  amplified  material  (PEP)  generated 
from  our  specimens,  and  found  similar  reliability  for  either 
genomic  DNA  and  PEP  genotypes  when  analyzed  by 
HuSNP.  The  concordance  was  similarly  high  for  both 
genomic  and  PEP  DNA,  although  slightly  lower  for  the 
PEP  material.  This  lower  concordance  was  primarily  be¬ 
cause  of  an  increase  in  no-signal  genotype  calls  seen  in 
the  PEP  material  versus  the  genomic  DNA  and  an  indica- 
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tor  of  the  potential  data  not  obtained  with  amplified  DNA. 
However,  in  cases  in  which  sample  Is  limited,  using  the 
HuSNP  assay  on  PEP  material  may  be  an  acceptable 
approach  to  genome-wide  analysis.  In  cases  In  which  the 
original  sample  is  extremely  limited,  the  use  of  whole 
genome  amplification  may  make  analysis  possible. 

We  also  used  the  data  generated  from  the  two  cases  to 
examine  allelic  loss  in  the  cell  populations  isolated  from 
the  tumors  by  bivariate  flow  oytometry.  There  was  an 
average  of  379  informative  SNP  sites  throughout  the  ge¬ 
nome  from  all  of  the  gDNA  HuSNP  assays.  This  is  very 
similar  to  the  expected  distribution  of  heterozygosity  as 
defined  using  biallelic  SNP  markers’' and  in  previously 
reported  HuSNP  data.''  As  has  been  previously  report¬ 
ed,'*  allelic  loss  data  obtained  from  the  HuSNP  agreed 
well  with  data  obtained  by  the  more  standard  method  of 
microsatellite  (STR)  analysis. 

Although  the  fixed  and  frozen  samples  from  case  2 
yielded  highly  concordant  HuSNP  results  on  all  chromo¬ 
somes,  the  cell  populations  with  close  Dl  (1 .43  and  1.49, 
respectively)  identified  in  case  1  exhibited  substantial 
differences  in  LOH  on  chromosomes  4  and  1 1 .  Because 
the  differences  were  confined  to  these  two  chromo¬ 
somes,  the  data  were  not  likely  to  be  the  result  of  a 
general  cross-contamination,  but  rather  reflected  a  bio¬ 
logical  difference  between  these  cell  populations.  The 
second  population  with  a  Dl  of  1 .82,  identified  in  the  forma¬ 
lin-fixed  tissue  block  was  also  distinct  and  exhibited  a 
slightly  higher  frequency  of  allelic  loss  throughout  the  ge¬ 
nome.  A  diversity  of  cell  populations  is  common  within  ad¬ 
vanced  breast  tumors'®'^°  and  may  reflect  the  development 
of  distinct  genotypic  clones  with  different  behavior  potential. 
Flow  cytometric  analysis  can  initially  define  cell  populations 
with  DNA  content  differences  that  can  be  further  resolved 
by  genomic  analysis,  yielding  important  information  about 
tumor  composition  that  would  otherwise  be  obscure. 

The  gDNA  HuSNP  anaiysis  of  the  two  cases  included 
in  this  study  yielded  more  genome-wide  data  than  could 
be  obtained  with  a  similar  amount  of  DNA  by  other 
means,  such  as  microsatellite  marker  analysis.  However, 
it  is  still  a  low-density  map,  with  an  average  of  one  SNP 
site  per  8.5  Mb  in  the  genome.  Another  limitation  of  the 
current  HuSNP  array  is  that  many  SNP  sites  included  in 
the  assay  are  clustered,  so  that  many  regions  of  the 
genome  are  well  represented  whereas  others  are  under¬ 
represented.  Given  that  array  assays  for  genome-wide 
analyses  are  continuing  to  be  developed,  we  expect  that 
the  next  generation  of  genetic  marker  arrays  using  similar 
technology  as  the  HuSNP  will  provide  more  uniform  and 
higher  density  coverage  of  the  genome.  The  data  from 
this  study  indicate  that  future  array  technologies  will  be 
suitable  for  use  with  DNA  obtained  from  routinely  pro¬ 
cessed  pathology  specimens. 
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